Parallel Genetic Algorithms for Tuning a Fuzzy Data Mining System

نویسندگان

  • QITAO LIU
  • SUSAN M. BRIDGES
  • IOANA BANICESCU
چکیده

In previous work, we have described methods that we have developed for tuning a fuzzy data mining system for intrusion detection using a hierarchical genetic algorithm. Unfortunately, the genetic algorithm approach is very slow due to the computational cost of the evaluation function. In this paper, we describe parallel implementations of the genetic algorithm that were run on both a multiprocessor Unix workstation and a high performance cluster running Linux. Very little speedup was achieved on the Unix workstation because of contention for the single file system; significant speedup was achieved with the cluster in which each node had its own file system. Experimental results of our implementations based on the master-slave parallel model and multiple population parallel model are compared, and the preliminary results indicate that the multiple population model may provide a higher quality solution for a shorter amount of time. INTRODUCTION Kuok, Fu, and Wong (1998) introduced the marriage of fuzzy logic and association rule mining to address the sharp boundary problem encountered when discretizing continuous attributes for association rule mining. We have further refined these techniques for intrusion detection applications (Luo and Bridges 2000). One of the difficulties encountered in this approach, however, is defining appropriate membership functions for the fuzzy sets that serve as values of the fuzzy attributes. We have subsequently found that genetic algorithms (GAs) are effective methods for tuning the fuzzy membership functions and for feature selection (Shi 2000; Bridges and Vaughn 2000). Unfortunately, the evaluation function for the genetic algorithm requires that data mining be performed for every member of the population at every iteration of the genetic algorithm. Although, in the intrusion detection domain, the tuning process is an off-line operation, the time required for the GA is very substantial. In order to improve the performance of the GA, we have investigated parallel implementations of the genetic algorithm for data mining on both a multiprocessor Unix workstation and a high performance cluster running Linux. In the remainder of this paper, we first describe our fuzzy data mining process and the sequential genetic algorithm that we have developed for tuning

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proposing a Novel Cost Sensitive Imbalanced Classification Method based on Hybrid of New Fuzzy Cost Assigning Approaches, Fuzzy Clustering and Evolutionary Algorithms

In this paper, a new hybrid methodology is introduced to design a cost-sensitive fuzzy rule-based classification system. A novel cost metric is proposed based on the combination of three different concepts: Entropy, Gini index and DKM criterion. In order to calculate the effective cost of patterns, a hybrid of fuzzy c-means clustering and particle swarm optimization algorithm is utilized. This ...

متن کامل

Development of an evolutionary fuzzy expert system for estimating future behavior of stock price

The stock market has always been an attractive area for researchers since no method has been found yet to predict the stock price behavior precisely. Due to its high rate of uncertainty and volatility, it carries a higher risk than any other investment area, thus the stock price behavior is difficult to simulation. This paper presents a “data mining-based evolutionary fuzzy expert system” (DEFE...

متن کامل

Identification of Cement Rotary Kiln in Noisy Condition using Takagi-Sugeno Neuro-fuzzy System

Cement rotary kiln is the main part of cement production process that have always attracted many researchers’ attention. But this complex nonlinear system has not been modeled efficiently which can make an appropriate performance specially in noisy condition. In this paper Takagi-Sugeno neuro-fuzzy system (TSNFS) is used for identification of cement rotary kiln, and gradient descent (GD) algori...

متن کامل

SECURING INTERPRETABILITY OF FUZZY MODELS FOR MODELING NONLINEAR MIMO SYSTEMS USING A HYBRID OF EVOLUTIONARY ALGORITHMS

In this study, a Multi-Objective Genetic Algorithm (MOGA) is utilized to extract interpretable and compact fuzzy rule bases for modeling nonlinear Multi-input Multi-output (MIMO) systems. In the process of non- linear system identi cation, structure selection, parameter estimation, model performance and model validation are important objectives. Furthermore, se- curing low-level and high-level ...

متن کامل

On the mining of fuzzy association rule using multi-objective genetic algorithms

The discovery of association rule acquire an imperative role in data mining since its inception, which tries to find correlation among the attributes in a database. Classical algorithms/procedures meant for Boolean data and they suffer from sharp boundary problem in handling quantitative data. Thereby fuzzy association rule (i.e., association rule based on fuzzy sets) with fuzzy minimum support...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001